Corpus: eng_newscrawl-public_2018_300K

Other corpora

5.1.18 Words nearly always as next neighbors

Strong NN co-occurrences with a low probability of being separated

The quotient below is calculated as freq(word1)*freq(word1)/NN_freq^2.

Word 1 Word 1 Frequency of word 1 Frequency of word 2 Frequency as NN Qoutient
Los Angeles 436 323 319 1.38
Hong Kong 139 114 107 1.38
Jong Un 75 67 59 1.44
Palo Alto 53 58 46 1.45
Boko Haram 64 55 52 1.30
Notre Dame 50 50 45 1.23
Tel Aviv 63 48 47 1.37
Planned Parenthood 40 37 36 1.14
Fuerza Pública 17 24 17 1.41
Kuala Lumpur 26 23 23 1.13
GLOBE NEWSWIRE 21 23 21 1.10
Buenos Aires 21 21 21 1.00
Login to your account 21 20 18 1.30
alma mater 20 19 19 1.05
Suu Kyi 17 17 15 1.28
Phnom Penh 17 15 15 1.13
Recep Tayyip 16 15 14 1.22
ROCKY MOUNT 11 12 11 1.09
San Luis Obispo 17 12 12 1.42
Mardi Gras 10 11 10 1.10
1136 msec needed at 2025-02-06 13:31